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(Dated: February 4, 2008) 

Abstract 

This report describes a measurement of the top quark mass, Mf op , with the dynamical likelihood 
method (DLM) using the CDF II detector at the Fermilab Tevatron. The Tevatron produces 
top/anti-top (tt) pairs in pp collisions at a center-of-mass energy of 1.96 TeV. The data sample 
used in this analysis was accumulated from March 2002 through August 2004, which corresponds 
to an integrated luminosity of 318 pb -1 . We use the tt candidates in the “lepton+jets” decay 
channel, requiring at least one jet identified as a b quark by finding a displaced secondary vertex. 
The DLM defines a likelihood for each event based on the differential cross section as a function of 
Mtop per unit phase space volume of the final partons, multiplied by the transfer functions from jet 
to parton energies. The method takes into account all possible jet combinations in an event, and 
the likelihood is multiplied event by event to derive the top quark mass by the maximum likelihood 
method. Using 63 tt candidates observed in the data, with 9.2 events expected from background, 
we measure the top quark mass to be 173.2 +24 (stat.) ± 3.2 (syst.) GeV/c 2 , or 173.2 GeV/c 2 . 

PACS numbers: 14.65.Ha, 12.15.Ff 
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I. INTRODUCTION 


The top quark mass is an important quantity in particle physics. Its precise value not 
only serves for setting basic parameters in calculations of clectroweak processes, but also 
provides a constraint on the mass of the Higgs boson. Therefore it is desirable to have a 
measurement with a precision comparable to that of other relevant clectroweak parameters, 
typically of the order of 0.1% — 1%, the latter corresponding to about 2 GeV/c 2 in M top . 
Based on Run I Tevatron data at a center-of-mass energy of 1.8 TeV (1992-1996), the CDF 
and D0 collaborations published several direct experimental measurements of M top with all 
decay topologies arising from tt production: the dilepton channel [1, 2], the lepton+jets 
channel [3, 4], and the all-jets channel [5, 6]. Including a recent reanalysis of D0 Run 
I data [7], the Run I world average top quark mass is 178.0 ± 4.3 GeV/c 2 [8]. A global 
standard model fit using this updated value gives the most likely value of the Higgs boson 
mass of 129 tig GeV/c 2 , and the 95% C.L. upper limit of 285 GeV/c 2 [9]. 

In this paper we present a measurement of the top quark mass in pp collisions at a/s = 1.96 
TeV at the Fermilab Tevatron. The data were obtained with the upgraded Collider Detector 
at Fermilab (CDF II) operated during Run II. The integrated luminosity of the data sample, 
collected from March 2002 through August 2004, is 318 pb” 1 . This is the total dataset for 
which all detectors including the silicon tracker were operating. The method employed is 
the Dynamical Likelihood Method (DLM) [10]—[13], which uses the differential cross section 
for the tt process as a function of M top in the likelihood definition. The permutations in 
assigning jets to the primary quarks from the t and t decays and the quadratic ambiguity in 
the ^-component of the neutrino momentum are incorporated in the likelihood for an event, 
and the likelihood is multiplied event by event to extract M top by the maximum likelihood 
method. Similar but not identical techniques were introduced and employed during Run 
I [7, 14, 15]. Using Run II data, CDF recently produced the best single top quark mass 
measurement using the template method with in situ jet energy calibration [16]. That 
analysis and this one are summarized together in [17]. 

In this method, we assume that the standard model (SM) accurately describes tt produc¬ 
tion and decay. This assumption is justified for two reasons: (1) Since the discovery of the 
top quark was established in Run I [18], its properties have been investigated using both 
Run I and Run II data, but no significant discrepancies between experimental data and the 



SM have been found [19]—[21]. (2) The SM neither predicts the top quark mass directly, nor 
explains why it is approximately 40 times more massive than the b quark, the isodoublet 
partner of the top quark. Therefore it is reasonable to use the SM in the likelihood definition 
(through the differential cross section) for the mass measurement. 

According to the SM, the top quark decays approximately 100% of the time into a W 
boson and a b quark. The W then decays to a quark-antiquark or lepton-neutrino pair. 
The measurement presented here uses events with ti decaying in the “Icpton+jets” channel, 
tt —> W + W~bb —> luqq'bb , as shown in Fig. 1, which provided the most accurate mass 
measurement in Run 1 because of higher statistics than the dilepton channel and lower 
background than the all-jets channel. This channel is characterized by a single high px 
lepton (electron or muon) and missing transverse energy from a W —> Iv decay, plus four 
jets, two from the hadronically decaying W boson and two b quarks from the top decays. 
The b quarks may be identified (“6-tagged”) by reconstructing secondary vertices from the 
decay of B hadrons with the silicon vertex detector (SECVTX tagging), as described in 
Section IV C. 



FIG. 1: Tree-level Feynman diagram of standard model tt production and decay in the lepton+jets 
mode. 

This paper is organized as follows. In Section II, we present a brief description of the most 
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important detector subsystems to this analysis. Section III describes the data samples that 
are used in the top quark mass measurement. Section IV presents particle identification and 
event selection. The background estimates are described in Section V. After a brief overview 
of the DLM procedure in Section VI, the definition of the dynamical likelihood function and 
the reconstruction procedure are discussed in Section VII. The transfer functions between 
jet and parton kinematics, which play a key role in the method, are presented in Section VIII. 
Section IX describes top quark mass determination studies using Monte Carlo for both the 
ti signal and background events. The effect of background on the likelihood distribution is 
also investigated. Section X presents the final top quark mass result after correcting for the 
mass-pulling effect of the background. Section XI discusses further checks on this analysis. 
The systematic uncertainties are presented in Section XII. Conclusions are summarized in 
Section XIII. 

II. THE CDF DETECTOR OVERVIEW 

The CDF II detector, a general purpose detector with azimuthal and forward-backward 
symmetry, is composed of independent subsystems designed for distinct tasks relating to the 
study of pp interactions. 

The CDF coordinate system consists of the 2 -axis along the proton beam direction, the 
azimuthal angle (j) defined in the plane transverse to the 2 -axis, and the polar angle 9 from 
the proton direction (usually expressed as the pseudorapidity p = — ln(tan(#/2))). The 
x- and y -axes point outward and upward from the Tevatron ring, respectively. Transverse 
energy ( Et ) and momentum (px) are defined in this plane, perpendicular to the 2 -axis. 

The three most relevant subsystems to ti —> lepton+jets event detection are the tracking 
chambers, the calorimeters and the muon chambers. These subsystems are briefly described 
below. A complete description of the CDF II detector can be found elsewhere [22], 

The tracking system consists of a large open-cell drift chamber and silicon microstrip 
detectors. These he inside a superconducting solenoid of length 5 m and diameter 3.2 m, 
which produces a 1.4 T magnetic held aligned coaxially with the beampipe, and are used 
for measuring charged particle momenta. The outermost system, the Central Outer Tracker 
(COT) is a 3.1 m long open-cell drift chamber which provides 96 position measurements in 
the radial region between 0.43 and 1.32 m [23] and in the pseudorapidity region \rj\ < 1.0. 
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Sense wires are arranged in 8 alternating axial and ± 2 ° stereo super layers with 12 wires 
each. The position resolution of a single drift time measurement is approximately 140 /im. 
Between the interaction region and the COT, there are three separate silicon detectors. In 
combination, silicon detectors provide high resolution position measurements for charged 
particles out to \rj\ = 2.0. The innermost device, Layer 00 [24], is a single-sided layer of 
silicon microstrip detectors mounted directly on the beampipe at a radius of 1.6 cm that 
provides an axial measurement as close to the collision point as possible. Between the COT 
and Layer 00, a five layer double-sided silicon detector (SVXII) covers the radial region 
between 2.4 and 10.7 cm [25]. Three separate SVX barrel modules are located along the 
beamline, covering a length of 96 cm. Three of the five layers combine an r-tp measurement 
on one side and a 90° stereo measurement on the other, and the remaining two layers combine 
r-(f) with small-angle stereo at ±1.2°. The typical hit resolution is 11 //m. Three additional 
layers of double-sided silicon strips, the Intermediate Silicon Layers (ISL), are located at 
larger radii, between 19 and 30 cm, and provide good linking between tracks in the COT 
and SVXII [26], 

Outside of the tracking systems and the solenoid, segmented electromagnetic (EM) and 
hadronic (HAD) sampling calorimeters are used to reconstruct electromagnetic showers and 
jets in the pseudorapidity interval \rj\ < 3.6 [27]-[29]. The calorimeters are segmented 
into projective towers of size 7.5-15° in <p and 0.1 in 77 . At the front of each tower, a 
lead-scintillator sampling electromagnetic calorimeter, 18 radiation lengths deep, records 
the energy of electromagnetic showers. In the central region (|y/| < 1.0), a layer of mul¬ 
tiwire proportional chambers (CES) measures the transverse shower profile at a depth of 
the maximum shower development. Behind the electromagnetic calorimeter is the hadronic 
calorimeter with roughly 5 absorption lengths of alternating layers of steel and scintillator. 

High p T muons used in this analysis are detected in three separate subdetectors. Two 
separated drift chambers cover the region \r)\ < 0.6: Directly outside of the hadron calorime¬ 
ter, four-layer stacks of planar drift chambers (CMU) detect muons with px > 1.4 GeV/c 
which penetrate the five absorption lengths of the calorimeter [30]. Behind another 60 cm 
of steel, an additional four layers (CMP) detect muons with p T > 2.0 GeV/c [31]. An ad¬ 
ditional system with 4 drift chamber layers and scintillation counters occupies the region 
0.6 < \rj\ < 1.0 (CMX), completing the muon coverage over the full fiducial region of COT 
tracking, \p\ < 1 . 0 . 
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III. DATA SAMPLES 


A. Luminosity and Triggers 

The results reported here are based on the data recorded during the period March 2002- 
August 2004, when the average instantaneous Tevatron luminosity was approximately 4 
x 10 31 cm _2 s _1 , and the highest was about 10 x 10 31 cm“ 2 s -1 . The recorded integrated 
luminosity for this period is 318 ± 19 pb _1 for electron and CMU/CMP muon analysis, and 
305 ± 18 pb' 1 for CMX muon analysis. 

CDF employs a three-level trigger system. We describe only the triggers important 
for this analysis, which select events containing a high momentum electron or muon. For 
electron candidates, the first level (LI) trigger requires a track with pr > 8 GeV/c matched 
to an EM calorimeter cell with Et > 8 GeV, and a ratio of hadronic to electromagnetic 
energy (F H ad/F EM ) less than 0.125. Calorimeter clustering is done in the second level (L2) 
trigger, which requires a track with pr > 8 GeV/c matched to an EM cluster with Et > 16 
GeV. At the third level (L3), a reconstructed electron with E T > 18 GeV is required. For 
muon candidates, a track with pr > 8 GeV/c matched to muon stubs in the muon chambers 
(CMU, CMP, or CMX) is required for LI and L2; the L3 trigger requires a Pt > 18 GeV/c 
track. 

B. Monte Carlo Programs 

The generation of tt events relies mainly on the ffERWIG v6.505 [32] and PYTfflA 
v6.216 [33] Monte Carlo programs, which employ leading order QCD matrix elements for the 
hard process, followed by parton showering to simulate gluon radiation and fragmentation. 
The CTEQ5L [34] parton distribution functions are used. For heavy flavor jets, the decay 
algorithm QQ v9.1 [35] is used to provide proper modeling of bottom and charm hadron 
decays. The ALPGEN vl.3 program [36], which generates high multiplicity parton final 
states using exact leading-order matrix elements, is used in the study of backgrounds. The 
parton level events are then passed to ffERWIG and QQ for additional QCD radiation, 
fragmentation and B hadron decay. 

The CDF II detector simulation [37] reproduces the response of the detector to par¬ 
ticles produced in pp collisions. Tracking of particles through matter is performed with 
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GEANT3 [38]. Charge deposition in the silicon detectors is calculated using a parametric 
model tuned to the existing data. The drift model for the COT uses the GARFIELD pack¬ 
age [23], with the default parameters tuned to match COT data. The calorimeter simulation 
uses the GFLASH [39] parameterization package interfaced with GEANT3. The GFLASH 
parameters are tuned to test-beam data for electrons and pions, and are checked by com¬ 
paring the calorimeter energy of isolated tracks in pp collision data to their momenta as 
measured in the COT. 

IV. PARTICLE IDENTIFICATION AND EVENT SELECTION 

A. Lepton Identification 

The identification of charged leptons produced by W decay provides the initial selection of 
the tt —> lepton+jets sample. After passing the trigger requirements, electron candidates are 
identified by requiring the electrons to be in the central pseudorapidity region of the detector 
(I hi A 1) and to have an EM cluster with E? > 20 GeV and a track with p t > 10 GeV/c. 
Several variables are used to discriminate against charged hadrons and photon conversions. 
We require that the extrapolated track match the shower location as measured in the CES, 
that the ratio of hadronic to electromagnetic energy in the calorimeter cluster, Eu^/Eem, be 
less than 0.055 + 0.00045 x Eem, and that the ratio of cluster energy to track momentum, 
E/p , be less than 2.0 (unless pt > 50 GeV/c, in which case this cut is not applied). The 
isolation variable, defined as the ratio of the additional energy deposited in a cone of radius 
A R = a/At/ 2 + A (j) 2 = 0.4 around the electron cluster to the electron energy, is required 
to be less than 0.1. Conversion electrons are removed by rejecting events that have a pair 
of opposite electric charge tracks (one of them the electron) in which the distance A (xy) 
between the tracks in the r-<p plane (at the conversion point) is less than 0.2 cm, and the 
difference between the polar angle cotangent of the two tracks, |Acot#|, is less than 0.04. 
Fiducial cuts on the electromagnetic shower position in the CES ensure that the shower 
is located in a well-understood region of the calorimeter. For isolated high momentum 
electrons from W decay, the tracking efficiency is measured to be 99.93 ^q; 35% [40]. The 
transverse energy can be measured from the electromagnetic cluster with a precision a / Et 
= 13.5%/a/-E r(GeV) © 2 % [27], 
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Muon candidates are identified by extrapolating COT tracks to the muon detectors. Two 
types of high-pT muon samples are used in this analysis. CMUP muons (\rj\ < 0.6) have a 
COT track linked to track segments in both CMU and CMP. A CMX muon (0.6 < |p| < 1.0) 
has a COT track linked to a track segment in the CMX. For both CMUP and CMX muons, 
we require that the COT track has px > 20 GeV/c, and that the energy in the calorimeter 
tower containing the muon is consistent with the deposit expected from a minimum ionizing 
particle. The latter rejects secondary particles in calorimeter hadron showers that produce 
tracks in the muon chambers. An isolation variable is defined as the ratio of the total 
energy deposited in a cone of radius A R = 0.4 around the muon track candidate (excluding 
the towers the muon passed through) to the track momentum, and is required to be less 
than 0.1. Backgrounds from cosmic rays are removed by requiring that the distance d of 
closest approach of the reconstructed track to the beam line be less than 0.2 cm. For high 
momentum COT tracks, the resolution at the origin is Sz ~ 0.5 cm along the beamline 
and 5d zz 350 pm (^ 40 pm with SVXII) for the impact parameter in the transverse plane. 
Additionally, the distance between the extrapolated track and the track segment in the muon 
chamber is required to be less than 3, 5 and 6 cm for CMU, CMP and CMX respectively. 
COT tracks are required to have at least 3 axial and 2 stereo layers with at least 5 hits 
each for both electron and muon candidates. From the COT, the transverse momentum 
resolution for high momentum particles is found to be 5px/px ~ 0.15% x p T (GeV/c). 

B. Jet Corrections and Systematics 

Jet reconstruction in this paper employs a cone cluster algorithm with cone radius A R = 
a J A rj 2 + A <p 2 = 0.4 [41], We measure the transverse energy Et = Esin 9, where 9 is the 
polar angle of the centroid of the cluster’s towers, calculated using the measured z position of 
the event vertex. The total energy E is the sum of the energy deposited in calorimeter towers 
within the cone. Jets are identified as isolated clusters that contain significant hadronic 
energy. Jet measurements make the largest contribution to the resolution of the top quark 
mass reconstruction due to their relatively poor energy resolution, approximately (0.1 x Ex+ 
1.0) GeV [42], Additionally, the uncertainty arising from the jet energy scale is the dominant 
source of systematic uncertainty for the top quark mass. In contrast, we assume the angles 
of the quarks are well measured from the jet angles. They are therefore directly used in the 
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mass reconstruction without correction. We briefly describe the jet energy corrections and 
their systematic uncertainties in this section. More details on the CDF jet energy response 
are available elsewhere [43]. 

1. Jet Corrections 

To be used for top quark mass reconstruction, measured jet energies are first corrected 
with a set of “flavor-independent” or “generic” corrections, so called because they are ex¬ 
tracted mainly from dijet and minimum bias samples. These corrections are made in sev¬ 
eral steps. A first correction scales the forward calorimeters to the central calorimeter 
(0.2 < |? 7 1 < 0.6) scale for data and Monte Carlo separately. A dijet balancing procedure 
is used based on the equality of the transverse energies of the two jets in a 2 —> 2 process. 
The correction is obtained as a function of rj and the transverse momentum, pt of the jet. 
The relative correction ranges from about —10% to +15%. The corrections are tested by 
comparing Et balance in 7 +jet events in data and Monte Carlo simulation. As shown in 
Fig. 2, after corrections the response of the calorimeter is almost flat with respect to p for 
both data and Monte Carlo simulations. A second correction is for multiple pp interactions 
due to high-luminosity operation of the Tevatron. The energy from additional pp interac¬ 
tions during the same accelerator bunch crossing can fall inside a jet cluster, increasing the 
energy of the measured jet. The correction associated with this effect is derived from min¬ 
imum bias data and is parameterized as a function of the number of identified interaction 
vertices in the event. This effect is corrected on average and is very small (less than 1%). 
A third correction is called the absolute energy correction. This correction is applied to 
account for calorimeter non-linearity and is based on the response of the calorimeter to in¬ 
dividual hadrons as measured by E/p of single tracks in the data. After this, the jet energy 
corresponds to the energy of the hadrons incident on the jet cone. The absolute correction 
varies between +10% and +40%, depending on the jet pr as shown in Fig. 3. The accu¬ 
racy of this correction depends on the Monte Carlo correctly modeling jet fragmentation 
into hadrons, for example the charged to neutral particle ratio, and the particle multiplicity 
and pr spectrum. This has been checked by comparing the jet charged particle multiplicity 
distributions in data and Monte Carlo. 

After these generic jet energy corrections, we use the transfer functions described in 
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Section VIII to account for the fraction of the quark energy deposited outside the jet cone 
as well as differences between light quark jets from W boson decay and the b jets coming 
directly from the top quark decay. Since the transfer functions are evaluated as a function 
of Et, the resulting top quark mass is insensitive to the difference in the E r distributions 
of the dijet and top quark events. 



FIG. 2: px balance, p^/p^ — 1, in 7 -jet events as a function of jet ?? after relative corrections are 
applied. Circles are data and HERWIG Monte Carlo simulation is plotted as triangles. 


2. Systematics Uncertainty on the Jet Energy Scale 

The systematic uncertainty on the jet energy scale comes from a number of sources. The 
uncertainty in the calorimeter response relative to the central calorimeter (relative response) 
is determined by varying the dijet event selection criteria and the fitting procedure. This 
uncertainty is typically between 0.5% and 1.0% for most jets used in the top quark mass 
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FIG. 3: The size of the absolute (hadron level) correction, CAbs> as a function of px of the jet for 
cone size 0.4. 

analysis. A second systematic uncertainty comes from the hadron jet modeling used in the 
absolute energy scale correction. The main sources here are uncertainties in the calorimeter 
response to single hadrons (E/p) and jet fragmentation (charged to neutral particle ratio). 
Smaller contributions come from the Monte Carlo modeling of the calorimeter response close 
to tower boundaries in azimuth, and from the stability of the calorimeter calibration with 
time. In total, this uncertainty ranges from 1.5% to 3.0%, depending on jet px- 

A third systematic uncertainty arises from modeling the energy that is deposited out¬ 
side the jet cone (out-of-cone correction). This uncertainty, which ranges from 2% to 6% 
depending on jet px-, is determined from the difference between data and Monte Carlo in 
7 +jet events. The jet correction systematic uncertainties from two other sources, the extra 
energy from multiple pp collisions and the underlying event, the energy associated with the 
spectator partons in a hard collision event, are negligible for this analysis. 
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In summary, the systematic uncertainties on jet energy measurements for jets in the 
central calorimeter (0.2 < q < 0.6) are shown in Fig. 4. The black line corresponds to 
the total uncertainty, obtained by adding in quadrature all the sources described above. 
Typically, it is 3% to 4% for jets with p T > 40 GeV/c. In order to check the energy 
corrections and systematic uncertainties, 7 -jet events are used since the jet px range in this 
sample is similar to that in tt events. Figure 5 shows the difference between 7 -jet balancing, 
defined as — 1, hi data and Monte Carlo after all jet corrections are applied. The ± 

lcr range adequately covers the spread in these data points. These jet energy uncertainties 
are propagated to the top quark mass measurement as described in Section XII. Additional 
process-specific uncertainties are also considered in that section. The most important of 
these for the top quark mass measurement is the 6 -jet energy scale. 



50 100 150 200 250 300 350 400 450 500 

p“ rr (GeV/c) 


FIG. 4: The systematic uncertainties as a function of the corrected jet px in the central calorimeter 
( 0.2 < \q\ < 0 . 6 ). 
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FIG. 5: The fractional difference between the jet and photon transverse momenta in 7 -jet events 

are calculated after all jet corrections are applied. Plotted here is the difference between this 

quantity in data and simulation as a function of photon px, for different r/ ranges. The dashed 
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lines show Tier from the jet energy systematics. 










































C. 6-Jet Tagging using Secondary Vertex Identification 


The identification of 6 jets from top quark decay plays an important role in this analysis. 
Since most of the selected kF+jets events coming from non -tt processes do not contain 
bottom or charm quarks in the final state, requiring the presence of b jets provides significant 
background reduction. 

The SECVTX silicon vertex 6-jet tagging algorithm searches within a jet in the central 
region for a displaced secondary vertex due to the decay of a B hadron [44, 45]. It uses 
tracks that are within A R < 0.4 of the jet axis and have hits in the silicon detector. A set 
of cuts involving the transverse momentum, the number of hits, and the X 2 /ndf of the track 
fit are imposed to select good quality tracks in a jet. Then the algorithm is performed as 
follows: (1) Find at least three good tracks with px > 0.5 GeV/c and an impact parameter 
significance do/cpio | > 2, where d 0 is the impact parameter of the track relative to the 
accelerator beamline (measured on average for each store of pp collisions) and is the 
uncertainty coming from both the track and beamline positions. At least one of the tracks 
must have px > 1 GeV/c. (2) Reconstruct a secondary vertex using the selected tracks. 
(3) Calculate the two-dimensional decay length of the secondary vertex ( L 2 d ) from the 
primary vertex. (4) Require L 2 d/o'l 2D > 7.5, where <tl 2D is the estimated uncertainty on 
L- 2 d, typically 190 /mi, to reduce the background from false secondary vertices (mistags). If 
a secondary vertex is not found, a second pass of the algorithm is carried out with tighter 
track requirements, demanding at least two tracks with px > 1 GeV/c and |do/°A>l > 3-5, 
including at least one track with p T > 1.5 GeV/c. The cut on L 2D /<7l 2D is the same as in 
the first pass. 

Based on simulation of the 6-tagging algorithm, requiring at least one 6-tagged jet keeps 
60% of top quark events while removing more than 90% of background events. The difference 
between the efficiency in the simulation and that in the data is measured using a 6-enriched 
dijet sample in which a non-isolated electron is found in one jet. We find a data to Monte 
Carlo tagging efficiency scale factor of 0.91 ± 0.06 [45], which is used with the Monte Carlo 
in estimating the expected background (see Section V). The uncertainty includes both 
systematic and statistical contributions. The main cause of the scale factor being less than 
1.0 is the difference in track resolution between the data and Monte Carlo simulation. 
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D. Missing Transverse Energy: fix 


The presence of neutrinos is inferred from transverse energy imbalance in the detector. 
The missing transverse energy is calculated as 

fir = - ^2 Et'i^ii (1) 

i 

where E l r is the magnitude of the transverse energy contained in calorimeter tower i, and 
fti is the unit vector from the interaction vertex to the tower in the plane transverse to 
the beam direction. If isolated high-p^ muon candidates are found in the event, the fix is 
corrected by subtracting the energy deposited by the muon in the calorimeter, and adding 
the muon px to the vector sum. The typical fix resolution in tt Monte Carlo events is 
approximately 20 GeV. Further corrections to fix related to jet energy corrections and the 
transfer functions are described in Section VIII. 

E. Event Selection 

The final state of the tt lepton+jets mode contains a high-momentum lepton candidate, 
missing transverse energy that indicates the presence of a neutrino from W leptonic decay, 
and four hadronic jets, of which two jets are expected to be b quarks. We summarize the 
selection criteria below: 

Exactly one isolated electron (muon) candidate is required, having E T > 20 GeV {px > 
20 GeV/c) and \rj\ < 1.0. Any event with two leptons satisfying the lepton criteria (see 
Section IV A) is removed. We also remove events where the second lepton candidate is an 
electron in the plug calorimeter or a muon that fails the CMUP requirement but has one 
CMU or CMP muon segment, to remove top dilepton events (tt —» l + vl~vbb). The missing 
transverse energy, fixi is required to be greater than 20 GeV. Events with Z boson candidates 
are removed by requiring that there be no second object that forms an invariant mass with 
the primary lepton candidate within the window 76-106 GeV/c 2 . Here, the second object 
is an oppositely-signed isolated track with px > 10 GeV/c for primary muons; for primary 
electrons it may be a track, an electromagnetic cluster, or a jet with Ex > 15 GeV and 
\r)\ < 2.0 that has fewer than 3 tracks and a high electromagnetic energy fraction. The 
primary vertex of the event must have its z coordinate within 60 cm of the center of the 
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CDF II detector. The jets are clustered after removing towers within electron clusters and 
correcting each tower Et for the location of the primary vertex z coordinate. We select 
the events that have exactly four jets with Et > 15 GeV and \r)\ < 2.0 to better match 
the leading order matrix element that is used in this analysis. This helps to reduce the 
contamination by initial and final state radiation by 10% compared to events with four 
or more jets. Finally, at least one SECVTX tagged b jet is required. The above selection 
yields 63 6-tagged events in which 39 events contain an electron and 24 a muon. Of these 63, 
sixteen double 6-tagged events are observed. The overall selection efficiency for these criteria 
including the branching ratio, estimated from tt Monte Carlo simulation, is approximately 
1.94 ± 0.01% for the electron channel, 1.22 ± 0.01% for muons in the CMUP, and 0.41 ± 
0.01% for muons in the CMX. 

V. BACKGROUND ESTIMATE 

It is important to this analysis that we have an accurate estimate of the background level 
in the final event sample, because to extract the top quark mass, we shift the measured 
mass to correct for background contamination (see Section IX B). We use the technique 
employed in the tt production cross section measurements described in [44, 45]. The major 
backgrounds come from misidentification, for example a fake lepton and large missing Et in 
events not containing a W boson, and events in which a light quark or gluon jet is mistagged 
as a b jet. The major physics background is the production of a IF boson along with heavy 
flavor quarks. 

A. Non-IF (QCD) Background 

The non-IF background (QCD multijets), events that do not contain a IF boson, is 
estimated directly from the data, separately for electrons and muons. These events include 
fake leptons and missing energy as well as semi-leptonic B decays. An isolated primary 
lepton and large fir due to the neutrino are characteristics of real IF events, not shared 
by most non-lF events. To estimate the number of non-lF events in the sample, we use a 
2 -dimensional plot of $t vs lepton isolation, defining four regions: 

A: isolation > 0.2 and $t < 15 GeV, 
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B: isolation <0.1 and Ifix < 15 GeV, 


C: isolation > 0.2 and $x > 20 GeV, 

D: isolation <0.1 and tyx > 20 GeV, 

as shown in Fig. 6. Region D contains the real W events. For the non-IF background, these 
two variables are assumed to be uncorrelated; therefore Nb/Na, the ratio of the numbers of 
low events at low and high isolation, is the same as N^/Nq, the ratio at high Tfix- The 
amount of non-IF contamination in region D is then calculated as N& (non- W) = Nb ^ c ■ 
Since backgrounds from W + heavy flavor (Wbb, Wcc, and Wc) and W+ a mistagged jet 



Missing E T (GeV) 


FIG. 6: Plot of missing Ex vs lepton isolation for events that contain an electron candidate 
(without an isolation cut) and two or more jets with Ex > 15 GeV and |? 7 | < 2.0 before the b 
tagging requirement is applied. 

are estimated by normalizing to the number of the “pretagged” events, those found prior to 
applying the b tagging algorithm, (see Section V C), the contributions of non-IF background 
to both the pretagged and the tagged samples have to be measured, even though we use only 
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the tagged sample estimate directly in the mass measurement. To evaluate the expected 
number of events in the tagged sample, we use two methods. One is to estimate Nx> directly 
from the tagged sample. However, this is limited by low statistics, hence we lower the 
isolation boundary from 0.2 to 0.1 for regions A and C. A second method is to scale the 
estimate in the pretagged sample by the tagging rate for non -W events. The number of 
events in region B with 2 or more jets is used to obtain a reliable tag rate [45]. These two 
estimates are found to be consistent within the statistical uncertainty. The final estimate is 
obtained from the weighted average of the two methods. 

B. Mistags 

A SECVTX tag in a jet without a heavy flavor quark is called a “mistag”. The mistag 
rate per jet is measured using a large inclusive-jet data sample, without relying on the 
detector simulation. It is parameterized as a function of the number of tracks in the jet, the 
jet Et before energy corrections, the rj and (j) of the jet, and the sum of the Ex's of all jets in 
the event with Ex > 10 GeV and \i]\ <2.4. To estimate the size of the mistag background, 
each jet in the pretag sample is weighted by its mistag rate, and then the sum of the weights 
over all jets in the sample is computed, after correcting for the fraction of pretagged events 
that are due to non-IT background (~ 10 % for electron and ~ 5 % for muon channel) to 
avoid double counting these two background sources. Using the number of mistagged jets 
as the number of mistagged events is a good approximation because the mistag rate per jet 
is sufficiently low, typically 1%. This method is tested using samples of pure mistagged jets 
in which the jet and reconstructed secondary vertex are on opposite sides of the primary 
vertex. We find good agreement between the predicted and observed numbers of jets in the 
pretagged sample as a function of jet E T [45]. 

C. W + Heavy Flavor (fT+HF) Backgrounds 

The production of IT bosons accompanied by QCD production of heavy flavor quarks 
in the processes Wbb, Wcc, and ITc produces a signature very similar to tt events in the 
final state, and is a significant part of the background for the tagged sample. These contri¬ 
butions, Nhf, are evaluated by Nhf = N wetag x Jhf x tbt.ag, where N pretag is the number 
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of pretagged events in the data, Jhf is the fraction of pretagged events containing Wbb, 
Wcc, and Wc, estimated using the Monte Carlo models ALPGEN+HERWIG, and eu a g is 
the b tagging efficiency of each background source. The heavy flavor fractions are found to 
be approximately 2-3% for Wbb and Wcc, and 6% for Wc events, and were calibrated by 
comparing dijet Monte Carlo events with data. Details of these calculations can be found 
in [45]. 


D. Other Backgrounds 

The WW, WZ, and ZZ background, Z —> rr, and electroweak single top production 
by both s-channel qq fusion and /-ch ann el bb-gluon fusion processes are evaluated based 
on predictions from Monte Carlo simulation by multiplying the acceptances for these pro¬ 
cesses, as determined from the PYTHIA Monte Carlo program, by their production cross 
sections [46, 47] and the integrated luminosity for the data sample. The Monte Carlo accep¬ 
tance is corrected for the differences between Monte Carlo and data for lepton identification 
and trigger efficiencies. The 6-tagging efficiency is also scaled by the MC/data tagging scale 
factor which was described in Section IV C. 

E. Background Summary 

Events having a lcptonic W decay plus 1 or 2 jets are used to test the background 
estimation procedure. We End agreement between the data and Monte Carlo predictions 
within their uncertainties. The results provide confidence that we can estimate the number 
of background events in the four jet topology. The background contributions to the W+4jets 
sample are summarized in Table I. We estimate the total number of background events to 
be 9.2 ± 1.8. The expected number of signal events for the predicted tt cross section ranges 
from 46 ± 5 events for M top = 170 GeV/c 2 (7.8 pb) to 37 ± 4 events for M top = 178 GeV/c 2 
(6.1 pb). The relative uncertainty on each cross section value is roughly 10%, mainly coming 
from the parton distribution functions [48]. However, the estimate of 9.2 background events 
has been extracted with little dependence on the theoretical prediction of the tt cross section. 
We find that a 5 GeV/c 2 difference in M top (corresponding to about a 1.0 pb difference in tt 
cross section) alters the background estimate by roughly 1%, corresponding to a negligible 
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TABLE I: The expected number of background events from individual sources and the fractions 
with respect to the 63 observed events. 


Source 

Number of Events 

Fraction (%) 

Non-W (QCD) 

3.07 

± 

1.06 

4.87 

Mistag 

2.27 

± 

0.45 

3.60 

Wbb 

1.70 

± 

0.79 

2.70 

Wcc 

0.81 

± 

0.40 

1.28 

Wc 

0.51 

± 

0.23 

0.81 

ww/wz/zz 

0.39 

± 

0.08 

0.62 

Single Top 

0.41 

± 

0.09 

0.65 

Background Total 

9.2 ± 1.8 

14.5 

Observed events 


63 


100 


~ 0.1 event. Therefore in this analysis, 53.8 events are assumed to be from signal tt events 
(9.2 background events subtracted from the observed 63 events). This is supported by the 
16 double b tagged events in the data, where the expected number of events estimated by 
scaling the 63 observed events is 16.8 ± 1.8 events, including an expected 1.4 background 
events. For a kinematic comparison, the Ht distribution is shown in Fig. 7. H? is defined 
as the scalar sum of the lepton Et , the and the Et s of the leading four jets. We find 
good agreement between the data and the Monte Carlo for both the double b tag ratio and 
the kinematic distribution. 

VI. ANALYSIS OVERVIEW 

The analysis proceeds as follows. For each event, a likelihood as a function of top quark 
mass is calculated by the dynamical likelihood method (DLM), described in Section VII. 
The DLM defines a likelihood for each event based on the differential cross section per unit 
phase space of the final partons in the elementary process. It does not however use the 
number of observed events to constrain M top based on the theoretical ttbar cross section. 
To infer the parton momenta, we employ transfer functions that relate the observed jet 
energies to the corresponding parton energies: four jets to four quarks ( qq' from the W, b 
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FIG. 7: Ht distribution for signal ( M top = 178 GeV/c 2 ) and background, normalized to 53.8 and 
9.2 events, respectively. The total 63 data events are shown as the triangles. 

and b ). The transfer functions are obtained from ffERWIG Monte Carlo tt signal samples. 
Section VIII describes the details and performance checks of the transfer functions. There 
are 6 or 2 possible assignments of the four jets to the individual partons, depending on 
whether 1 or 2 jets are 6-tagged, and for each assignment 2 solutions for the ^-component 
of the neutrino momentum. Instead of selecting one particular assignment (e.g., the one 
giving the maximum likelihood), we average the likelihoods for all possible jet assignments 
and neutrino solutions in an event, and such event likelihoods are multiplied together to 
obtain the joint likelihood function for the entire data sample. We take the average rather 
than the sum in order not to give greater weight to single 6-tag events with their larger 
number of jet assignments. After calculating the top quark mass under the assumption that 
all events are tt , the effect of the background is corrected by using a mapping function that 
provides a mass-dependent correction factor. The mapping functions are extracted using 
Monte Carlo pseudo-experiments in which the numbers of signal and background events are 
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Poisson distributed around the expected means. The mean number of signal events is not 
changed for different top quark mass samples. Finally, we extract the measured value of the 
top quark mass using the expected background fraction estimated in Section V. 

VII. DYNAMICAL LIKELIHOOD METHOD 

The DLM was originally proposed in 1988 [10] and developed in [11, 12]; details of the 
latest formulation are described in [13]. In DLM, we generate the parton kinematics from 
the observed quantities, and the likelihood of the reconstructed parton state is defined by 
the differential cross section per unit phase space of the final partons in the elementary 
process. 

A. Definition of the likelihood 

1. Differential cross section 

The elementary parton process in a pp collision can be written as, 

n 

ai/p + a 2 /p^> -> C, C = ^Cj. (2) 

i= 1 

where (i\ and a 2 are the initial partons - quark, anti-quark, or gluon - in the proton and 
anti-proton, respectively, and ci, c 2 , ■ ■ ■, c n are final state partons and leptons. These are 
defined after initial-state radiation but before final-state radiation. In the case of the tt 
lepton+jets channel, the initial parton set (ai,a 2 ) is (q,q), (q,q) or (g,g), and the final 
leptons and partons are l, is, q, q', b, b or their anti-particles, where (q, q) are quarks from W 
decay, and l — e or g. Throughout this paper, a particle 4-momentum and its 3-momentum 
are represented by a small letter in italics and in bold, respectively: e.g., a symbol “p” 
represents the proton’s 4-momentum, and p its 3-momentum. The final partons are assumed 
to have their pole masses (4.8 GeV/c 2 for the b jets and 0.5 GeV/c 2 for the W daughter 
jets), so that their 3-momenta define their states unambiguously. 

The hadronic cross-section for process (2) is given by 

da = dz ai dz a2 d 2 p T f ai /p(z ai ) fa 2 /p{ z a2 ) fr {pt) 

xda(ai + a 2 —> C\ a), (3) 



where da is the parton level cross section [49], 


da(ai + 02 * C] ex) — — 9 9 l-^( a i ~h <i2 C'i °<-)\ 2 d^^( a i + a^C). ( 4 ) 

(°i • a 2 ) 2 ~ m 2 ai m 2 a2 

In Eq. (3), the symbol ct represents a set of dynamical constants to be measured, e.g., masses, 
decay widths and coupling constant ratios. In this analysis, ct is simply the top quark mass 
M top . The variables z ai and z a2 are the energy fractions of cp and a 2 in hadrons p and p 
respectively, and m ai and m a2 are their masses that are assumed to be zero in this analysis. 
Px is the total transverse momentum of the initial and final systems in the plane transverse 
to the beam axis. Functions f ai / P (z ai ) and fa 2 /p{ z a 2 ) denote the parton distribution functions 
(PDF’s), while fr{jPr) is the probability density function for the total transverse momentum 
of the system acquired by initial state radiation. In this analysis, we use the leading order 
PDF, CTEQ5L [34], Other PDF sets are used to calculate the systematic uncertainty. The 
function /t(pt) is obtained by running the PYTHIA generator. 

In Eq. (4), Ad is the matrix element of the process that is being studied (in this case, tt 
production and decay described in Section VIIB), and d&n is the Lorentz invariant phase 
space element, 


d ^ f) n (2 . m - ^ 

We use Eq. (4) to formulate the parton level likelihood. The basic postulate is that 
final partons occupy an n-dimensional unit phase space volume in the neighborhood of 
c = (ci,.. ., c n ). When a momentum set c is given, the total probability for this final state 
to occur is obtained by integrating Eq. (3) over initial state variables z ai , z a2 and pr, as 

da 




(/) 


= I(ai , a- 2 ) | M (ai + a 2 —»■ C] ct) 


where 


J(cq, a 2 ) — 


(2tt) 


jfai/p( z ai)fa 2 /p( z a 2 ) f t(.PT ) 


( 6 ) 


( 7 ) 


4a/( cp • a 2 ) 2 

is the integration factor for the initial state. Because of the 5-function in Eq. (4), the initial 
parton momenta (i\ and a 2 are uniquely defined by that of C. 

For a given set of c = (ci,..., c n ), we define the parton level likelihood for ct by 


r (p) / | \ / da 

L{’(ol\c) = /q 




(/)’ 


( 8 ) 
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where / 0 is given by 


u e(M 0 )a T (M 0 ) 

In Eq. (9), ctt(M 0 ) and e(M 0 ) are the total cross section and the detection efficiency for 
the true (pole) top quark mass of the sample, respectively. Thus l 0 is the integrated lumi¬ 
nosity per event in the sample. This method does not make use of any constraint from the 
theoretical tt cross section as a function of M top . Since l 0 only depends on the true (pole) 
top mass, it does not vary event by event in the sample [13] and only changes the absolute 
value of the likelihood, i.e., it has no effect on the final result. In this sense, this is not a 
real likelihood and any bias has to be corrected by the mapping function. The statistical 
uncertainty is also corrected by checking the pull distribution as described in Section IX. 


2. Propagator factors 


When process (2) includes internal lines of the Feynman graph, for example r in 

ai/p + a 2 /p —> r + c j+1 H-b c n , (10) 

r —> ci H-b Cj, (11) 

we have to consider the propagator factor for a particle r. We treat, in this channel, t, t, 
W + and IE - as internal lines (r) as illustrated in Fig. 1. 

We factorize the matrix element as 

| M( ai + a 2 - C; a)\ 2 = \M wod \ 2 V{s r )\M dec \ 2 , (12) 


where M. prod and A4 de c are the matrix elements for the production process and decay re¬ 
spectively, and s r is the virtual mass squared of r, which satisfies 

* = £»’• (13) 

i =1 

For the propagator factor V(s r ), we assume the Breit-Wigner form, 


V{s r ) = 


(14) 


(s r - M r 2 ) 2 + M 2 r 2 ' 

In the reconstruction of v z , the unmeasured ^-component of the neutrino momentum, we 
generate the W mass squared sw according to II(svk), where 


n(s) = V(s)I I V(s)ds, 


(15) 
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and solve Eq. (13) for v z (quadratically ambiguous). The function II(s) satisfies 


n(s)ds = 1. 


( 16 ) 


3. Transfer functions for observables 

Final quarks and gluons are not directly observed; they undergo hadronization, are ob¬ 
served by detectors with finite resolution, and are reconstructed as jets. Jet energies are 
generally calibrated using generic QCD jets, so we need additional corrections for b jets 
and W daughter jets in the tt processes. To describe the relation between the parton and 
observed quantities (observables), we introduce the transfer function (TF) w(y\x), where y 
represents a set of observables and a; is a parton variable set that corresponds to y. In the 
Z+jets process, y consists of the momenta of the e or y and of the 4 jets, and the missing 
transverse energy (fir)- In the present analysis, we use the TF only for quarks and jets. 
Electrons and muons are measured well in the detector, and fir is calculated from other 
observed quantities in an event (see Section VIIIB). 

The differential probability for the parton variables x to be observed as y, dP(y,x), is 
defined by the TF w(y\x) as 

dP{y;x) = w(y\x)dy. (IT) 

The TF for a single quark, w(y\x), is obtained from the (x,y) distribution of the tt Monte 
Carlo events. The event selection criteria are applied to these events. The effect of the 
detection efficiency for the variable set (x, y ) is thus included in the determination of w(y\x), 
and the normalization condition, 

Jw(y\x)dy=l, (18) 

holds. 

4- Likelihood for a single path, a single event and multiple events 

Single path reconstruction and its likelihood The single path likelihood is defined for 
each complete set of parton kinematics and calculated as follows: 

(1) We assume that the momentum of the e or y is precisely measured. 
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( 2 ) The four jets are assigned to the four final state quarks. We call such an assignment 

a “topology” denoted by I t {h — 1, • • • , N t ). Therefore N t represents 6 or 2 possible 
topologies in an event, depending on whether 1 or 2 jets are 6-tagged. 

( 3 ) Once a topology is specified, we randomly generate the parton kinematics (6, 6, q, q') 

according to the transfer functions. We identify the momentum direction of each 
jet with that of the assigned quark, and transfer variables x {Exb, E t i , Et q , Etq■>) are 
chosen using as input y , the transverse energies of the corresponding jets. More details 
are given in Section VIII. Each random generation is denoted by k. 

( 4 ) After ( 1 ), ( 2 ) and ( 3 ), the transverse momentum of the neutrino (v x ,is y ) is identified 

with the measured value of and then corrected using both jet corrections and 
(fc-th) jet transfer functions. Details of this correction are discussed in Section VIII. 
Then the parton momenta are defined except for u z , the unmeasured z component of 
the neutrino momentum. To get v z we choose Sw according to II(sw) in Eq. (15), 
and v z is obtained by solving Eq. (13). A quadratic ambiguity results in two solutions 
(z/ 2 i, v z2 ) that are specified by an integer I s (=1 or 2), which is treated separately from 
“topology” as defined in (2). 

( 5 ) From procedures ( 1 ), ( 2 ), ( 3 ) and ( 4 ), an event configuration (I t and I s ) and parton 

momenta (k for a generation by the transfer functions) are uniquely specified. The 
likelihood of a single path is then 

L[ k \l t ,I s ,x k ]Mtop\y (l) ) = I»,k,i\M top ), (19) 

d&Q 

where i is the event number, and dstyf is the phase space for (l,v,b,b,q,q'). In this 
context, when we use “a single path” the likelihood (the differential cross section) can 
be calculated without any ambiguity, since all information such as assignments and 
parton momenta are determined. Then for each path, we make a parameter scan of 
M top uniformly in its search region (typically 155-195 GeV/c 2 ). 

Likelihood for a single event All possible paths (configurations), each labeled by k, I t 
and I s , are mutually exclusive, and we define the likelihood of the i-th event as the average 
of the likelihoods for all paths, 
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j K Nt 2 

HM top \y«>) = -2- Y.Y.Y. £ ‘‘V«. 1« *>■< M ‘°r\y {l> )- (20) 

t k= 1 It = 1 / s = l 

This definition of the event likelihood thus contains the correct set of (J f , I s ) (if the event 
is tt —> Z+jets). The sum over k corresponds to the numerical integration of the parton 
kinematics according to the transfer functions. Therefore we repeat the procedure ( 2 )-( 5 ) 
a large enough number of times (K) so that the value of L(M top \y^) converges, which is 
typically 50,000 times. In summary, each time a parton configuration and set of momenta 
are selected (I t , I s and k), we calculate the likelihood (single path) and in order to obtain 
the event likelihood, we average all possible single path likelihoods by numerical integration. 

Likelihood for multiple events The single event likelihood is a function of M top . For 
multiple events, we get mutually independent functions of M top . Hence to obtain the top 
quark mass from a total of N ev events, we form the product of all the event likelihoods, and 
take negative two times the logarithm of this product, 

/N ev \ 

A {Mtop) = -2 hr (J] L(M top \yV)j . (21) 

Then we obtain the top quark mass as the maximum likelihood estimate of M top , 

M top = M top at the minimum of A (M top ), (22) 

and its uncertainty from the points where A A = 1. 

B. Matrix Element Calculation in the lepton+jets channel 

The matrix element squared \M.f is factorized into 3 parts: (1) tt production (|Al t f| 2 ), 
(2) the propagators of the top and anti-top [V t i and Vth), and (3) the decay matrices, |«M«| J 
and \M t h\ 2 , for lcptonic and hadronic top decays, respectively. Namely, 

\M \ 2 = \Mtf\ 2 VtiVt h \Mti\ 2 \M th \ 2 . (23) 

The production matrix element for the qq initial state at leading order [50]-[52] is 

I Maim - tt) | 2 = -]p(2 — /3 2 sin 2 $*), (24) 
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where 9* is the angle between the top quark and the incident quark in the proton in the 
tt center of mass system, (3 is the velocity of the top quark and g s is the strong coupling 
constant. 

For the gg initial state [50]—[52], the matrix element can be expressed as 


I M t t(gg -> tt)\- = g*( 


6tiT 2 8 


Wi + rl + p 


P~ 


4tiT 2 


where 


2 (gi-t) 2 (c/ 2 • t) 4 Ml p „ , , , 2 

n = — 7 —,t 2 = —7 —,p = ——,s = {gi + g 2 ) , 


(25) 


(26) 


s s s 

gi and c/ 2 are the incident gluon momenta in the proton and anti-proton, and M top is a free 
parameter for the top quark mass. In these equations, the tt spin correlations have been 
ignored. This effect is included in the mapping functions described in Section IX B since 
spin correlations are included in the HERWIG Monte Carlo samples that are used to make 
the mapping functions. Since we don’t know what the initial state was, the likelihoods for 
the two processes (qq and gg) are summed up in the event likelihood with the appropriate 
PDF weights obtained from CTEQ5L. 

The propagators for the top and anti-top quarks are as specified by Eq. (14) in which M r 
corresponds to M top and s r is the invariant masses of the leptonically (tl) or hadronically 
decaying top quark (th). 

The decay matrix elements for the lcptonic and hadronic channels are given by 

(t-l)(b-v) 


\M tl \ Z = 4 g* 


{Siu - MlY + MlV 


2 i 
W 


\Mth\ 2 = 


(t-qi){b-qj) 


2 tfj (^ - M w) 2 + M w?V 


(27) 

(28) 


where Si u and represent the invariant masses squared of the lepton+neutrino and the 
two quarks from the W respectively. For the mass and decay width of the W, we assume 
the world average values, Mw = 80.4 GeV/c 2 and Yw = 2.1 GeV/c 2 . In Eq. (27), the dot 
product of b and v can be calculated because the ^-component of the neutrino momentum, 
u z , has already been determined in step (4) above. In Eq. (28), we make both possible 
assignments of the two jets to q and q' from the W, and the likelihoods corresponding to 
the two possibilities are averaged. 
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VIII. TRANSFER FUNCTIONS (TF) 


As described in the preceding section, the transfer functions deal with the relation between 
parton and corrected jet energies. This allows us to use the full distribution, including tails, 
of the fraction of quark energy deposited outside of the jet cone. Also, since the generic 
jet corrections are based on the QCD dijet process, the transfer functions can correct for 
ft-specific b jets and W daughter jets. 


A. Definition and Performance 


The transfer variable set ( x , y) we use in this analysis is the transverse energy of a parton 
(quark) and the corresponding jet, 

x = Et (parton), y = At (jet), (29) 


where At (jet) has been corrected with the CDF generic corrections described in Sec¬ 
tion IV B 1. TF’s are obtained for b jets and W daughter jets separately and are applied 
only to the four highest Et jets in an event, which are assumed to come from the t and t 
decay. 

The TF’s are obtained with the following procedure. We generate events with the HER- 
WIG and PYTHIA Monte Carlo event generators and a full detector simulation, and select 
events with the same criteria as applied to real data. From the accepted events, we se¬ 
lect those jets that are within a distance A R < 0.4 from a final state quark. Using these 
“matched” jets, we obtain a 2-dimensional density function of the number of events at ( x , y), 
D(x, y : M top ). 


The number of events in a dx dy bin is given by 

da 


D(x,y : M top )dxdy = L int 


dx 


dx X w(y\x\\M top )dy 


(30) 


where L int is the integrated luminosity of the sample. The transfer function is obtained by 
removing the cross section factor from D(x,y : M top ), i.e., 


w(y\x\\M top ) 


1 

D(x, y . AAop)) 


(31) 


where 


Tix Li n t (^x f D{x, y . Altop^dy. 


(32) 
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Values of n x and w(y\x\\M top ) are numerically obtained from D(x,y : M top ) by Eqs. (32) 
and (31), respectively. 

The TF w(y\x\\M top ) depends on M top , most significantly through the event selection 
criteria. The total selection efficiency is about 2-3% depending on an input M top ; approxi¬ 
mately 4,000 events are accepted from the 200,000 events that is the typical size of the Monte 
Carlo sample for each top quark mass point. Thus the statistics of the Monte Carlo samples 
are not sufficient to obtain an M top -dependent TF. Therefore as an approximation, we use 
TF’s averaged over the M top search region (130-230 GeV/c 2 , sampled every 5 GeV/c 2 ), 

w(y\x) = (w(y\x\\ M top )) Mtop • (33) 

The transfer variables x and y are strongly correlated, so we make a variable transforma¬ 
tion from ( x , y ) to (£, Y) as 

£ = —, Y = y. (34) 

x 

The TF for variables (£, Y) is defined by 

/(£, Y) d£ dY = — D(x, y)dxdy. (35) 

n x 

In practice, /(£, y) is obtained by filling a (£, Y) histogram with weight 1 jn x for each 
Monte Carlo event. We call the variable £ a “response variable” in this paper. In the 
function /(£, V), £ and Y are much less correlated than x and y in w(y\x), so wider bins can 
be used in Y. In the reconstruction of parton kinematics, £ is generated from the observed 
value of Y(= y) according to /(£, V), and x is then determined by Eq. (34). 

An advantage of deriving the TF from Monte Carlo events is that the effect of the 
detection efficiency and acceptance is automatically included in the determination of the 
TF. 

As illustrated in Fig. 8, the TF’s strongly depend on the Et and slightly depend on 
pseudorapidity rj of the jets. Therefore we calculate TF’s in 10 bins of jet E T (15 to >105 
GeV in 10 GeV steps) and 3 bins of \r}\ (0.0-0.2-0.6-2.0) that correspond to different regions 
of the calorimeter [22], Thus separately for b and W jets, we make thirty histograms. In each 
bin, the mass averaged TF contains 5,000 jets on average, while if we use M top -dependent 
TF, it is about 250 which is not enough to get smooth functions. In the figure, the means of 
the response variable as a function of Et are compared with the transfer functions extracted 
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from only a single mass sample ( M top = 178 GeV/c 2 ). The b jet response is lower (higher) 
at lower (higher) Et for the mass averaged TF, while for W jets the response is almost 
identical. This is because the b jets, being the direct daughters of the top quarks, carry 
more of the mass information. By averaging over the samples in a wide mass range, the 
top quark mass dependence is reduced without needing the enormous statistics for making 
mass dependent TF’s. The response distributions are asymmetric due to the finite size of 
the clustering cone. Consequently we do not fit the distributions with a functional form, 
but rather generate random numbers to accurately sample the full distributions. 

To validate the transfer function performance using ti Monte Carlo samples with different 
masses, we investigate the invariant mass of the jet pair from the W and the three jets from 
the hadronically decaying top quark using the following procedure. 

1. Jet-parton matching 

To ensure proper assignment of jets to partons, we require the distance (AR) between 
a jet’s direction and a parton’s direction to be less than 0.4. Moreover, if two or more 
jets are within AR<0.4 of a parton direction, we discard the event. 

2. Applying the transfer function 

This is performed by random generation of the response variable £ from the given 
Y = y. Explicitly, the transverse energy of the parton is obtained by 

E t (parton) = ^ (36) 

Then the dijet ( W ) and trijet (top) invariant masses are calculated. The random 
number generation is repeated more than 50,000 times (we call this “scanning”). After 
scanning, distributions of the dijet and trijet invariant masses are obtained for each 
event. 

3. Extracting the invariant mass 

We calculate the mean of the distribution obtained in step 2 by fitting the distribution 
from each event with a Gaussian function and storing the fitted mean value in a 
histogram. 

The invariant masses of the dijets and trijets before and after applying the transfer 
function are shown in Fig. 9. Since the out-of-cone correction is not applied to the masses 
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FIG. 8: The distributions of the response variable £ for b jets (upper-left) and W jets (upper-right) 
for different ranges of jet r]. Each distribution is normalized to unit area. The jet Et dependence 
is shown in the lower-left and lower-right for b and W jets, respectively, by plotting the mean of 
the response variables as a function of jet Et- 

before the transfer function, (we start with hadrons within the jet cone and apply the transfer 
function to obtain the parton energy), lower masses are observed, while after the transfer 
function is applied, the final values of the mean agree with the generated input masses. The 
left plots in Fig. 10 show the r] dependence of the invariant masses, while the right plots 
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show the pt dependence. There is a large px dependence in the plots before the transfer 
functions are applied. The transfer functions, however, largely eliminate this dependence. 
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FIG. 9: Comparisons of reconstructed invariant masses of W dijets (left) and top trijets (right) 
in HERWIG Monte Carlo samples, as a function of input top quark mass, before and after the 
transfer function is applied. Dashed lines correspond to the input masses of the W and top quark. 

We investigate the Monte Carlo generator dependence by comparing PYTHIA and HER¬ 
WIG, which have different fragmentation modeling, but no significant discrepancies are 
observed. Since the top quark mass samples are produced with HERWIG, we make the 
transfer functions with HERWIG and examine the generator bias in Section XII. We also 
check alternative variables that could be used in the response function: jet E, p, or px- 
As with the generator dependence, no differences are found in the shape or mean of the 
response functions or in the reconstructed invariant masses of W dijets or top trijets. 

B. Missing Transverse Energy 

The “raw fir”, which is defined in Eq. (1), is corrected by applying generic jet energy 
corrections and then the transfer functions. First, the definition of missing transverse energy 
is rewritten using the observed objects in the sample to take into account the generic jet 


39 




8g : PYTHIA Input : 80.4 GeV/c 
-a- After TF 

oT' 86- 

r -a- Before TF 


o 

| 84t 
S 821 

i 8Oh = -&:0:# 

l 78 ^ 

.2. ™ - A - 

o 74 r A-A - 

72- 
7a 


Generator 

-o 


i 


“ 0 'r' ■ 

-e--a- : 8--o^@- -0-0--5-" 0 " 




V A_ " H Y 


■H-f 


0.5 1 1.5 2 2.5 

Pseudo Rapidity |rj| of W 


72 

70, 


88 :PYTHIA Input M^,: 80.4 GeV/c 
: -a- After TF 
*5 86 ■-*- Before TF 

% 84 ‘ 

^82) Q 

^ 7R' O' i -C0 

g 78 ' -4- -YY 

« 76 ■ _ A . ++ -A- 

b 74- , -a-" a "_ a _-a- -a- -a- 


Generator 

,-Q 


* 


-A- 


0 20 40 60 80100120140160180200 
p T Of W (GeV/c) 



Pseudo Rapidity |rj| of Top 


p T of Top (GeV/c) 


FIG. 10: rj and px dependence of dijet W (upper) and trijet top (lower) invariant masses, with 
the generated masses in PYTHIA Monte Carlo shown as open circles. Masses with only generic 
corrections are shown in open triangles, and the open squares show the results after TF application. 

corrections, 

4 

— = Ex (lepton) Y ^ ^ -E/^(jet) Y Xx (37) 

2—1 

where -E^(jet) is the Ex of the jet after generic corrections, and X T corresponds to all other 
calorimeter-deposited energies. (Within Xt, the generic corrections are also applied to all 


40 








jets with Et > 8 GeV and \rj\ < 2.4.) The above expression shows that the measurement 
is highly correlated with the jet energy measurements and corrections. Therefore, it is not 
considered to be an independent observable in this analysis. We calculate the transverse 
component of the neutrino momentum, v Tl from the leptonic W decay as 

4 

u T =fi T + Y(E J T (jet) - E j t { corr)) (38) 

3 =1 

where E J T ( corr) is the jet Et after generic and transfer function corrections are applied to 
each of the leading four jets. 

IX. TOP QUARK MASS RECONSTRUCTION 

This section describes how we extract the top quark mass and checks of the top quark 
mass reconstruction using Monte Carlo simulation. In this analysis, all events are assumed 
to be signal when the likelihood is calculated. The result is then corrected for the presence 
of background. Therefore, we first present the behavior of the background and its effects on 
signal reconstruction. Based on large sets of pseudo-experiments with varying background 
fractions, we derive the background correction function (“mapping function”) for the top 
quark mass parameterized as a function of the background fraction. At this point this 
method is fully calibrated with the Monte Carlo sample. 

A. Background Effect on the Likelihood 

As described in Table I in Section V, there are various background processes that may 
affect this measurement. We use the ALPGEN Monte Carlo with the CDF detector simu¬ 
lation to model mistags and W + heavy flavor events. The W + four light-flavor partons 
(W4p) process can be used to investigate mistags, since mistags come from a false secondary 
vertex, which is mainly due to track and vertex resolution effects. For non -W (QCD) back¬ 
ground, we use a non-isolated lepton sample (isolation / > 0.2, but fix > 20 GeV) from 
real data. Other electroweak processes, diboson and single top production, are modeled by 
PYTHIA Monte Carlo samples. All events are subject to the event selection described in 
Section IV. 


41 



The likelihood distribution and the mass-likelihood peak are expected to be changed by 
the existence of background events. To understand the background effects more fully, we 
first calculate the dynamical likelihood defined by Eq. (20) for each background sample, 
and the average joint maximum likelihood masses are estimated from pseudo-experiments 
with ~100-1000 events, depending on background source. Their values mainly result from 
the lepton ( Et > 20 GeV) and jet energy ( Et > 15 GeV) cut thresholds. The rnistag, 
W+ HF, and non -W samples produce almost the same maximum location in the range of 
155-160 GeV/c 2 , while the single top sample has 170 GeV/c 2 , a slightly higher mass. The 
diboson background has a slightly lower mass, around 155 GeV/c 2 , near the lower limit of 
the search region (155-195 GeV/c 2 ). For each background, the peak width of maximum 
likelihood masses per event is much larger than for signal events, and its peak is relatively 
lower compared to the top quark mass search range (as shown in Fig. 22 in Section XI). 

The effect of background on top quark mass extraction is seen in Fig. 11, which shows the 
reconstructed top quark mass from 63-event pseudo-experiments as a function of the back¬ 
ground fraction. The peak mass is shifted lower and the width broadens as the background 
fraction increases. 

It is important to know the effect of each of the backgrounds on the mass determination 
in order to properly account for the background composition uncertainty. Figure 12 shows, 
for 178 GeV/c 2 tt Monte Carlo, how the reconstructed mass is shifted from the input mass 
by individual background sources as the background fraction is varied over the range 0- 
50%. This is done with pseudo-experiments having 63 total events, where the number of 
background events is fluctuated using Poisson statistics. We do not see significant differences 
among the W+ HF, rnistag, and non -W (QCD) samples, which in sum account for more 
than 90% of the background and hence dominate the total background (the solid squares 
in Fig. 12). Thus the size of the mass-shift produced by the background is not sensitive 
to the relative fractions of kF+HF, rnistag and non-IF. On the other hand, the single top 
sample produces a smaller negative shift and diboson events a slightly larger negative shift 
compared to the dominant sources of rri is tag/ IF+HF/ non- IF . Each of these two sources is 
responsible for approximately 5% of the total background. 

In summary, background reduces the likelihood peak mass. We evaluate the size of these 
mass shifts and derive a correction, the “mapping function” discussed in the next section. 
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FIG. 11: An example of mass shift due to background events. The plot shows the reconstructed 
mass (input M top = 178 GeV/c 2 ), varying the background fraction from 0% to 30% (expected 
fraction is 14.5%). For each distribution, 5000 sets of pseudo-experiments that contain 63 events 
each are performed. Each distribution is fitted with a Gaussian function. 

B. The mapping function 

There are two sources that cause the input top quark mass and the reconstructed top 
quark mass to differ. One is the top quark mass dependence of the transfer function, and 
the other is the effect of background. We combine the two effects into a single mass- 
dependent correction factor, the mapping function, which is obtained from many sets of 
pseudo-experiments. Figure 13 shows the reconstructed top quark mass as a function of 
its input mass for various background fractions. The background fraction ranges from 0% 
to 50%, where the relative fraction of each background is that given in Table I. In each 
pseudo-experiment, the number of events from each background source and the total number 
of events are Poisson fluctuated. As one can see in the figure, even with 0% background the 
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FIG. 12: The difference between the reconstructed mass (M rec ) and Mo, the mass at 0% background 
(~ 177.5 GeV/c 2 ), due to individual background sources, using a signal sample of M top =178 
GeV/c 2 , as a function of the background fraction. The closed squares represent the combined 
background using the expected composition from Table I. The expected background fraction of 
14.5% is shown as the dashed line. 

reconstructed top quark mass does not have unit slope. This is due to a small top quark 
mass dependence of the transfer function as well as to the effect of gluon radiation and the 
contamination of the data sample from other top quark decay modes. As expected from 
the background study, the reconstructed top quark mass is shifted lower as the background 
fraction increases. The inset of Fig. 13 shows the slope of the linear fit (po of po • x + p\) 
to the mapping functions as a function of background fraction. One can see very stable 
behavior up to background fraction of 50%. The estimated background fraction of 14.5% is 
used to extract the top quark mass. 
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FIG. 13: The reconstructed mass obtained from the mean of the pseudo-experiments, as a function 
of input mass with the background fraction varying from 0% to 50%. The inset shows the slope 
of the linear fit to the mapping function as a function of the background fraction. The expected 
background fraction of 14.5% in this data sample is shown as the dashed line. 

C. Method Check 

The method described above is tested for possible systematic bias by running large num¬ 
bers of pseudo-experiments using Monte Carlo samples. Each set of 63 events (mean) in a 
pseudo-experiment consists of on average 53.8 signal events and 9.2 background events, with 
each source Poisson fluctuated. For each pseudo-experiment, the fit of the —21nL distribu¬ 
tion provides a measured top quark mass as well as the positive and negative uncertainties 
by fitting with a second order polynomial with different curvature on the positive and neg¬ 
ative sides (four parameters). After applying mapping functions for a 14.5% background 
fraction to each pseudo-experiment, we obtain a slope consistent with unity (0.997 ± 0.006) 
between the input and reconstructed masses. A pull distribution, defined as the input top 
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quark mass minus the reconstructed mass divided by its estimated uncertainty, is generated 
for each of 11 different input top quark mass samples, where each mass point is generated 
from 1000 pseudo-experiments and then is fitted with a Gaussian function to extract the 
center and the width of the pull distribution. The center of the pull distribution is consistent 
with zero (0.015 ± 0.021) as illustrated in Fig. 14. The width of the pull distribution as 
a function of the top quark mass is shown in Fig. 15. It is seen that the pull widths are 
slightly larger than one (1.042 ± 0.014). This is because this technique assumes that all 
events are from tt signal. When backgrounds, other decay channels or extra gluon radiation 
contaminate the data sample, our assumption is violated and the reported uncertainties will 
not necessarily be correct. This effect is observed in the pull width. Therefore we correct 
the final statistical uncertainty in order to have a pull width equal to one, corresponding to 
68% coverage in Gaussian statistics, by scaling the reported uncertainties. The scale factor 
of 1.04 is extracted by fitting the pull width over the full range of true top quark mass. After 
applying the mapping function and scaling the statistical uncertainty, we conclude that the 
top quark mass is reconstructed without bias, over a wide range of input masses. 

X. THE RESULTS FROM THE DATA 

We have 63 tt candidate events passing the event selection criteria. The joint likelihood 
of these events is shown in Fig. 16. From the fit, we obtain M top = 171.8 (stat. only) 
GeV/c 2 , assuming there is no background. We then apply the mapping function to remove 
the mass-pulling effect of the background. Figure 17 shows the extracted top mass as a 
function of the background fraction. The top quark mass changes by +1.4 GeV/c 2 for a 
background fraction of 14.5%. 

For the final result, we use the estimated 14.5% background fraction, which gives M top = 
173.2 l 2 ;® (stat. only) GeV/c 2 . The statistical uncertainty is also scaled by the slope of 
the mapping function mass shift extracted from Fig. 13 and by 1.04 from the pull width in 
Fig. 15. Figure 18 shows the likelihood distribution for each of the 16 data events containing 
two 6-tagged jets. Some of these events have two or three peaks because we sum up all 
combinations, each of which could produce a different maximum likelihood point. For these 
16 events, backgrounds are expected to be small (~1.4 events) since two b jets are tagged. 

To test how likely the reported statistical uncertainty is, we generated a set of Monte 
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FIG. 14: The mean (center) of the pull distribution as a function of the input top quark mass is 
consistent with zero, as shown in the figure. 

Carlo pseudo-experiments at a top quark mass of 172.5 GeV/c 2 (the closest mass sample to 
the measured mass), with the number of events in each subsample equal to that observed 
in the data. Figure 19 shows the expected negative and positive statistical uncertainties. 
The arrows indicate the statistical uncertainties for the fit to the data. The probability of 
having a smaller uncertainty than that from data is estimated to be 19%. 

As a consistency check, the top quark mass is measured using different subsamples to 
ensure the robustness of the final result. The analysis procedure applied to these measure¬ 
ments is the same as the one used for the entire data sample. Figure 20 shows the resulting 
top quark mass for the various categories. Comparisons are made by splitting the events into 
(1) electron and muon channel, (2) lepton charge (±), (3) 1 6-tag and 2 6-tag events, (4) run 
period A which collected data until September 2003 and run range B with data accumulated 
after that date. The corresponding integrated luminosities are roughly the same for the two 
run ranges. The same mapping function is used to estimate the mass in each category using 
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FIG. 15: The width of the pull distribution as a function of the input top quark mass is consistent 
with a horizontal line fit (pO = 1.042), as shown in the figure. 

the expected background fraction of 14.5% except that a background fraction of 9 % (1.4/16) 
is used for 2 6-tag events in category (3). 1.4 and 16 are the expected number of background 
2 6-tag events and the number of 2 6-tag events observed in the data, respectively. Although 
inconsistencies would indicate the presence of new physics in this mode, or perhaps problems 
with the analysis method, the Monte Carlo modeling, or detector performance, all results 
are consistent with each other and with the default measurement. 

XI. CROSS CHECKS 

In order to ensure that the method, calibrated by Monte Carlo samples, describes the 
data correctly as well as to check how well the Monte Carlo itself models the data, we 
compare various variables for the data with the Monte Carlo predictions for combined signal 
and background with regard to (1) the absolute likelihood, (2) the maximum-likelihood top 
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M top (GeV/c 2 ) 


FIG. 16: The joint negative log likelihood distribution of the 63 events observed in the data. The 
fit gives M top = 171.8 I 20 GeV/c 2 , before any corrections. 

quark mass, (3) the maximum-likelihood hadronic W mass, and (4) transfer functions. The 
normalization of these comparisons is done in the same way, using the expected numbers of 
events of 9.2 for background and 53.8 for the signal, giving the observed 63 events in total. 

A. Absolute Likelihood Value 

Although the absolute value of the likelihood in DLM is arbitrary, we can compare the 
Monte Carlo with the data. The signal likelihood for the i-th event is defined as 

LL„, = [ L‘(M)dM, (39) 

where the integration is over the search region 155-195 GeV/c 2 . Figure 21 shows the com¬ 
parison of the log of the event likelihoods in the data and the Monte Carlo samples. We 
find good agreement between the data and Monte Carlo. 
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FIG. 17: Extracted top quark mass using the mapping function as a function of the background 
fraction. 

B. Maximum Likelihood Top Quark Mass 

A second check uses the event-by-event maximum likelihood mass. We show this quantity 
for each event in Fig. 22. The signal Monte Carlo sample used for the comparison is generated 
with M top = 172.5 GeV/c 2 , close to the central value from the data. The combined background 
distribution has a peak around 150-160 GeV/c 2 , while the signal events peak at the input 
value of 172.5 GeV/c 2 . The Monte Carlo prediction agrees well with the data. 

C. Hadronic W Mass (W —► jj ) 

We assume that the top quark always decays to a b quark and a real W boson. Therefore 
in the top quark mass likelihood, we fix the W mass at 80.4 GeV/c 2 . To check this, we 
remove the constraint in the likelihood on the mass of the W that decays into two jets and 
instead constrain the top quark mass to 172.5 GeV/c 2 . Then in each event, the invariant 
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FIG. 18: Event likelihood distributions as a function of the top quark mass for the 16 double 
6-tagged events in the data. 

mass of the two jets assigned to the W at the maximum likelihood point is plotted. Figure 23 
shows the comparison between the data and Monte Carlo. We conclude that the dijet mass 
is consistent with that expected from Monte Carlo tt events. 

D. Validation of Transfer Functions 

The transfer function is checked by comparing the data and the simulation directly. This 
is important because we rely on the Monte Carlo simulation for the relation between partons 
and jets. The energy scale of the jets is understood to ~ 3%, with possible biases taken into 
account through the systematic uncertainty on the top quark mass. However the resolution 
and even the scale itself for this specific physics process should be checked. To do this, 
the response variable £ is selected at the maximum likelihood point for each event. Since 
each time the likelihood is calculated, we assign which jet corresponds to which parton, 
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FIG. 19: The expected positive and negative statistical uncertainties from pseudo-experiments 
using 172.5 GeV/c 2 tt Monte Carlo samples with the same number of events as in the data. The 
arrows indicate the positive and negative uncertainties for the data. 19 % of the pseudo-experiments 
have smaller uncertainties than those in the data. 

we can extract the response variables for “jets assigned as b quarks” and “jets assigned as 
W daughter jets”. These distributions will of course include mis-assignments and gluon 
contamination, but by comparing the Monte Carlo and the data directly, it is possible to 
check whether the transfer functions are well modeled. Monte Carlo studies have shown 
that the mean value of the £ distribution is slightly different for signal and background, 
and the resolution of the background is much wider than for the signal sample. The direct 
comparisons between data and MC are shown in Figs. 24 and 25 for b jets and W jets 
respectively. Since in each event there are two b jets and two W jets, the number of data 
entries in these plots is twice the number of events (63). As a summary, the mean and 
RMS are listed in Table II. The good agreement indicates that the jet energy scale is well 
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FIG. 20: Consistency checks: Comparisons between (1) electron and muon channel, (2) lepton 
charge (±), (3) one 6-tag and two 6-tag events, (4) run period A which collected data until Septem¬ 
ber 2003 and run range B which is after September 2003. The corresponding integrated luminosities 
are roughly the same between the two run ranges. Each point includes the statistical uncertainty 
only. 

calibrated and no additional systematic uncertainty is needed beyond those from generic jet 
energy corrections. This test has the potential to further constrain the jet energy scale. In 
the future, as the integrated luminosity increases, we can use this together with the hadronic 
W —> jj mass to reduce the jet energy scale uncertainty. Indeed, CDF has recently used 
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FIG. 21: Event likelihood distribution. The number of signal and background events is normalized 
to 63, the number of observed events. The Monte Carlo signal, background, and the combined 
predictions are shown as histograms. The triangles are the 63 data events. 


TABLE II: Summary of the mean and RMS of the response variables £ for the data and the Monte 
Carlo in Fig. 24 and 25. 




b jet 


W jet 


Mean 

RMS 

Mean 

RMS 

MC 

0.044T0.002 

0.264T0.002 

0.012T0.002 

0.280T0.002 

Data 

0.039T0.022 

0.263T0.018 

0.022T0.026 

0.281T0.020 


the dijet mass (hadronic W mass) to reduce the jet energy scale systematic uncertainty in 
the template top quark mass analysis [16]. 
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FIG. 22: The maximum likelihood mass for each event in the data compared to Monte Carlo. The 
signal Monte Carlo sample is for Mt op = 172.5 GeV/c 2 . 

XII. THE SYSTEMATIC UNCERTAINTY 

We have performed a number of studies of systematic uncertainties. For each source 
of uncertainty, we change the input sample and estimate the impact on the reconstructed 
top quark mass based on a number of pseudo-experiments using Monte Carlo simulations 
where the input top quark mass is the Run I Tevatron average, 178 GeV/c 2 [53]. The 
reconstructed mass from each input sample for the various systematic sources is calculated 
by the same procedure as applied to the data sample; i.e., likelihood computations, followed 
by the mapping function for a background fraction of 14.5%. These masses are compared 
to the nominal mass from HERWIG or PYTHIA, depending on the source. The shift in 
the mean from a Gaussian fit over a large number of pseudo-experiments is taken as the 
systematic uncertainty. 
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FIG. 23: Dijet (W —> jj ) invariant mass distribution for the maximum likelihood solution for 
signal (Mto P = 172.5 GeV/c 2 ) and background, normalized to the expected number of events. The 
triangles show the 63 data events. 

A. Jet Energy Scale 

With regard to the jet energy corrections, we consider three systematic sources: first the 
generic corrections calibrated by the QCD dijet process, second the transfer functions for b 
and W daughter jets from top decay, and third the 6-jet energy scale. 

First we evaluate the impact on the top quark mass from systematic uncertainties in the 
generic jet energy corrections. The details of the generic jet energy corrections are described 
in Section IVB. The relative, absolute energy scale (hadron jet modeling), and out-of¬ 
cone corrections have uncertainties of roughly 1%, 2%, and 2.5%, respectively. We apply a 
±lcr shift to both signal and background events and make event selection cuts on the shifted 
samples. The reconstructed masses are then calculated by the DLM procedure. We take half 
the difference between the means of the ±la distributions. Table III lists the uncertainties 
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FIG. 24: Comparison of the 6-jet response variable £ between the data (triangles) and the simu¬ 
lation (histograms show signal, background, and total(= signal ± background)). The means and 
resolutions are summarized in Table II. 

TABLE III: The systematic uncertainties on the top quark mass for each jet energy systematic 


source. 

Jet Energy Systematic A M top GeV/c 2 

Response relative to central scale 0.6 

Modeling of hadron jets (absolute scale) 2.0 

Modeling of parton showers (out-of-cone) 2.2 

Total systematic due to jet energy scale 3.0 


from individual corrections. The total uncertainty is taken to be the quadrature sum of 
these uncertainties and is found to be ±3.0 GeV/c 2 . 

Second is the systematic uncertainty from modeling of the transfer functions. In Sec¬ 
tion XI, the TF is checked by comparing the Monte Carlo simulation with the data and found 
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FIG. 25: Comparison of the W -jet response variable £ between the data (triangles) and the simu¬ 
lation (histograms show signal, background, and total(= signal + background)). The means and 
resolutions are summarized in Table II. 

to be consistent. Therefore we only account for the difference of TF’s between PYTHIA 
and HERWIG. We make two sets of TF’s, one each from PYTHIA and HERWIG. They are 
applied to the same Monte Carlo sample, HERWIG with M top = 178 GeV/c 2 . The difference 
between the two is found to be 0.2 GeV/c 2 . 

The last systematic related to the jet energy scale arises from the 6-jet specific energy 
scale. The light quark jet scale is set by the generic corrections which are deduced using 
samples that are mainly light quark and gluon jets. In addition, the sensitivity of the 
top mass to the light quark jet energy scale is reduced by the W mass constraint in the 
likelihood. On the other hand, the top quark mass is very sensitive to the 6-jet energy scale, 
so its additional uncertainty has to be estimated. We consider three possible sources: (1) 
6-quark decay properties, (2) fragmentation properties, and (3) different color flow. 

The B meson semi-leptonic branching ratios are varied in the simulation by 3% (30 ± 
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3%), corresponding approximately to the uncertainty in the current world average [53], to 
estimate its impact on the 5-jet energy scale. We find that the total uncertainty on 6-jet 
response is 0.4%, which translates to a top quark mass difference of 0.4 GeV/c 2 . Using the 
LEP [54, 55] and SLC [56] results from large Z —> 66 datasets, we constrain the possible 
fragmentation models in Monte Carlo calculations by changing the Peterson parameter [57] 
to match the experimental results within their uncertainties. The variations introduce an 
additional uncertainty of ±0.4 GeV/c 2 . For color flow modeling, we vary the parameters of 
the algorithms used to generate color flow in both PYTHIA and HERWIG. The amount of 
ambiguous energy, i.e., energy that cannot be assigned to the 6 jet or the initial state parton 
due to the color connection, is estimated to be 3% of the 6-jet energy scale. By considering 
large variations of the parameter related to color flow modeling, the amount of ambiguous 
energy changes by 0.3% of the total 6-jet energy, corresponding to ±0.3 GeV/c 2 in the top 
quark mass. 

These three contributions are added in quadrature, and the resulting ±0.6 GeV/c 2 is 
assigned as an additional systematic uncertainty due to the modeling of the 6-quark energy 
scale. 

B. Initial and final state hard radiation 

Initial and final state gluon radiation (ISR and FSR) affect the top quark mass measure¬ 
ment. ISR produces extra jets that can be misidentified as a tt daughter, while FSR can 
cause a final state quark jet energy to be measured low. To evaluate the level of ISR, Drcll- 
Yan dilepton events (ee and /x/x) are used since there is no FSR and they are produced via 
qq annihilation, the dominant production mechanism for tt at the Tevatron (85% at NLO). 
The average dilepton p t, {pt), which reflects the size of ISR activity, is shown in Fig. 26 as 
a function of the dilepton mass squared. A logarithmic dependence is seen between the two. 
By extrapolating to the energy scale of tt production, we find the allowed range for (p T ). 
Two PYTHIA Monte Carlo samples are made with parameters adjusted to cover the range: 
one with A qcd — 73 MeV, K = 2.0 and the other with A qcd = 292 MeV, K = 0.5 for 
— 1 (Jisr and ±1 ajsR , respectively, where K is a scale factor applied to the transverse mo¬ 
mentum scale. Corresponding curves are also shown in Fig. 26. This yields an uncertainty 
of ±0.4 GeV/c 2 . Since both ISR and FSR are controlled by the same DGLAP evolution 
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equation, the same variations of A qcd and K are used to generate FSR systematic samples 
by varying only FSR modeling. This results in a ±0.5 GeV/c 2 variation in the top quark 
mass. 



FIG. 26: The average pt of dilepton events shows a logarithmic dependence on the dilepton 
invariant mass squared. The data are compared with PYTHIA samples created with nominal 
settings as well as those with increased and decreased ISR activity. 


C. Parton Distribution Functions 

For the parton distribution functions, we add in quadrature uncertainties derived from 
two sources: differences from 20 pairs of CTEQ6M [58] uncertainty eigenvectors (±lcr), and 
MRST [59] with two different A qcd values (300 and 228 MeV). The result is an uncertainty 
of ±0.5 GeV/c 2 , of which 0.45 GeV/c 2 comes from the 20 eigenvectors. 
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Both the PDF and ISR systematics, which impact the px of the tt system, reflect the 
sensitivity of the DLM method to the production mechanism. To make an extreme test 
of this, we created a signal Monte Carlo sample of tt resonance production in which 175 
GeV/c 2 top quarks are produced from the decay of a 700 GeV/c 2 resonance. In this sample, 
the top quark decay properties are the same as those in the SM. The shifted mass is found to 
be 2.0 GeV/c 2 , demonstrating that the method is relatively insensitive to major variations 
in the production mechanism even though we use the SM tt production matrix element in 
the event likelihood. This is because the most sensitive factor in the likelihood is the top 
quark propagator rather than the production and decay matrix elements. 

D. Other Systematic Uncertainties 

Possible bias in the Monte Carlo generator is estimated by comparing PYTHIA and 
HERWIG. HERWIG deals with spin correlations in the production and decay of tt, while 
PYTHIA does not. Another difference between the two is the fragmentation model, where 
PYTHIA uses the string model while HERWIG adopts the cluster model. We estimate 
the associated uncertainty to be ±0.3 GeV/c 2 by taking the difference of the reconstructed 
masses between PYTHIA and HERWIG using the same mapping function extracted from 
HERWIG. Another systematic uncertainty comes from the mapping function, for which we 
use a background fraction of 14.5%. The uncertainty on this fraction is ±2.9% from the 
uncertainty in the mean expected background of ±1.8 events as shown in Table I. From 
a series of pseudo-experiments by changing background fraction by ±2.9%, we estimate 
this uncertainty to be ±0.2 GeV/c 2 . The statistical uncertainty on the expected number 
of background events (9.2) is already taken into account by the correction obtained from 
the width of pull distribution discussed in Section IX C because the expected number of 
background events has been Poisson fluctuated in the pseudo-experiments. 

The uncertainty due to background modeling, ±0.4 GeV/c 2 , comes from two sources: 
We evaluate the difference between the reconstructed masses obtained by using only one 
of the individual background process, rather than using combined background. Then the 
maximum difference among the major background sources (W± heavy flavor quarks, W+ 
mistagged jets, non-IF background) is used. The other source is the variation with different 
choices of the Q 2 scale (4M^, M^, M^/4, and M^ ± P^w) which is the characteristic 
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TABLE IV: The summary of systematic uncertainties. 


Source 

A M top GeV/c 2 

Jet Energy Corrections 

3.0 

Transfer Function 

0.2 

ISR 

0.4 

FSR 

0.5 

PDFs 

0.5 

Generator 

0.3 

Background Fraction 

0.2 

Background Modeling 

0.4 

b Jet Energy Modeling 

0.6 

b Tagging 

0.2 

Total 

3.2 


energy scale of the hard scattering process using the ALPGEN Monte Carlo program. This 
takes into account possible variations in the background composition. Finally, as described 
in Section IV, the 6-tagging efficiency is different in data and Monte Carlo. Only the jet 
Et dependence of the tagging efficiency is important in the mass analysis. By varying the 
slope of the efficiency as a function of Et by ±lcr, we ford the top quark mass shifts by 
±0.2 GeV/c 2 . The uncertainty due to the finite statistics of the non-IT data sample and 
the Monte Carlo samples used to make the mapping functions are negligible. 

E. Summary of systematic uncertainties 

The systematic uncertainties are summarized in Table IV. The largest one comes from 
the uncertainty in the jet energy measurement. The sum in quadrature of all the systematic 
uncertainties is 3.2 GeV/c 2 . 
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XIII. CONCLUSION 


Using the dynamical likelihood method, we measure the top quark mass to be 

M top = 173.2 +I 4 (stat.) ± 3.2 (syst.) GeV/c 2 
= 173.2 l£o GeV/c 2 

from 63 events, corresponding to an integrated luminosity of 318 pb -1 accumulated in the 
CDF Run II experiment. By using the maximal information from the tt production mech¬ 
anism and assuming the validity of the SM, a reduction of the statistical uncertainty is 
obtained. The precision of this single measurement in fact is slightly better than the Run 
I world average, and the result is consistent with other recent measurement by CDF [16], 
which provided the best single measurement (173.5 + 3 '® GeV/c 2 ) using the template tech¬ 
nique with a dijet W mass constraint to reduce the jet energy scale uncertainty. The current 
DLM analysis technique uses the jet energy scale determined with generic jet samples. How¬ 
ever as the luminosity increases, a reduction of the dominant systematic uncertainty, due to 
the jet energy scale, is crucial. DLM will be able to further constrain the jet energy scale 
using the hadronic W —> jj mass in tt events as done in [16]. We expect that other systern- 
atics also can be improved as the size of control samples increase. A reduced top quark mass 
uncertainty with increased data sample size will contribute to the detailed understanding of 
the electroweak interaction as well as to the search for physics beyond the standard model. 
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